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Abstract. We survey the interplay between the Riesz means and Beurling 
moving averages of the title, obtaining Abelian and Tauberian results relating 
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1. Introduction 

Riesz (or typical) means have proved very useful since their introduction 
by Marcel Riesz in 1909. He used them for summing Dirichlet series, and so 
providing analytic continuation; see e.g. the Cambridge Tract by Hardy and 
Riesz [HarR], and the monograph by Chandrasekharan and Minakshisun- 
daram [ChaM III]. A Riesz mean R{ A, k ) has an order k > 0 and a type A, 
where A : M + —> R + is a function increasing to infinity; we write A n for A(n), 
and regard A as specified by these values. We confine attention here to order 
k — 1, and abbreviate R( A, 1) to R( A). 

Such Riesz means were used extensively by Karamata [Kar2] in the 1930s. 
Writing 



s —y c R( A) for 


he gave necessary and sufficient Tauberian conditions for such Riesz conver¬ 
gence to imply ordinary convergence s(x ) —* c as x —> oo. 

The whole held of Tauberian theory was revolutionised by Wiener in 1932 
[Wiel-3], who passed from special kernels to general kernels. The Wiener 
Tauberian theorem deals with convolutions (of a function with a kernel), 
which may be written multiplicatively (using the group (R+, x), and Mellin 
transforms) or additively (using the group (R, +) and Fourier transforms); 
for textbook accounts, see [Wie2], Hardy [Har4 XII], Korevaar [Kor, II], Wid- 
der [Wid, V], 
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Not all important Tauberian questions lend themselves to treatment in 
such a convolution framework, most notably that of the Borel summability 
method and its relatives ([Har4 VIII, IX], [Kor VI]). In unpublished lectures 
of 1957, Beurling gave an extension of the Wiener Tauberian theorem, allow¬ 
ing one to pass from statements of the form 

L s{u)K ^m)w)^ c L K(u)du {x001 

to similar statements with the kernel K replaced by H] here H, K are in 
Li(R), K is a Wiener kernel (its Fourier transform has no real zeros), S is 
bounded, and the function (j>{x) : R + —> R + is o(x) as x —» oo, measurable, 
and satisfies 

<f>(x + t(j)(x))/(j)(x) —> 1 (x —> oo) Vi e R; ( BSV) 

call such functions 0 Beurling slowly varying (notation: 0 e BSV), and 
call the left-hand side here the Beurling convolution with respect to 0 G 
BSV. Then Wiener’s Tauberian theorem extends to this new setting, to 
give Beurling’s Tauberian theorem , which reduces to it in the special case 
0 = 1; for a textbook account, see [Kor IV. 11]. 

If one takes for H the indicator function of an interval, the conclusion of 
Beurling’s Tauberian theorem takes the form 

X-\-C(f)(x) 

S{u)du —>■ s (x —> oo) Vc > 0. ( BMA 

We call the left-hand side here the Beurling moving average w.r.t. 0. We 
shall be principally interested in sequences (s n ); writing S(x) := ^2 n<x s n , 
the above becomes 

—rr V s n —>• s (x—too) Vc > 0. ( BMAa,) 

c<b(x ) 

x<n<x-\-c<j)(x) 

Such Beurling moving averages have many uses - for example, in prob¬ 
ability theory in connection with strong laws of large numbers - and have 
been studied from that point of view in e.g. [Binl-7], [BinGol,2], [BinGa], 
[BinM], [BinR], [BinT], They have also been studied recently from the point 
of view of Beurling slow and regular variation (see [BinOl-4]). This is related 
to the important subject of Karamata regular variation [Karl], to which we 
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return below; for textbook accounts see e.g. [BinGoT] (BGT for brevity), 
[Kor IV]. 

Our aim here is to study the inter-relationship between different Beurl- 
ing moving averages, as 0 varies (increases or decreases). As 0 increases in 
(. BMA 0 ), more averaging takes place, and this suggests that matters improve 
and convergence is preserved. It has been known since [Binl] that Beurling 
averages are special cases of Riesz means. For Riesz means R( A), matters 
improve as A decreases. Furthermore, R( A) and R(p) are equivalent if log A 
and log n are comparable (0 < lim inf log A/ log p, < lim sup log A/ log /x < 00 ). 
This is the second consistency theorem for Riesz means ([HarR Th. 17] for 
the case A i-> log A, [Har2, §8], [Har3] in general; [ChaM II]). This becomes 
transparent in our context of Beurling moving averages, as we shall see in §2 
below. 

2. Results 

We shall need local uniformity - that is, that ( BSV ) holds uniformly on 
compact t-sets. For this, we need to impose a condition on 0, and we have 
several choices here. 

1. We can take 0 continuous. Then uniformity holds by Bloom’s theorem of 
1976 ([Bio]; BGT, §2.11; [BinOl]). 

2. We can take 0 monotone [BinOl]. 

3. We can take 0 measurable and with the Darboux property (intermediate- 
value property): [BinOl]. 

4. We can take 0 with the Baire and Darboux properties [BinOl]. 

These are all quite weak; indeed, continuity and monotonicity usually hold 
from the context in practice. 

When one has local uniformity in (BSV), one writes 0 G SN, and calls 
0 self-neglecting ; we restrict attention to such 0 below. 

We have mentioned the relevance of Karamata’s theory of regular varia¬ 
tion in §1 above (see e.g. BGT Ch. 1). Here the main result is the uniform 
convergence theorem (UCT): if 

f(\x)/f(x) ->• g( A) (x -)> 00 ) VA > 0, (K) 

then the convergence is uniform on compact A-sets in (0, 00 ) if / has the Baire 
property (‘is Baire’) or is measurable, and the limit g has the form g( A) = X p 
for some p (the characterisation theorem). There is a similar UCT in the 
Bojanic-Karamata/de Haan theory of regular variation (see e.g. BGT Ch. 
3, and for the Bojanic-Karamata contribution, [BojK]). Whereas Beurling 
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slow variation originally appeared as a minor topic in Karamata theory (see 
e.g. BGT §2.11), it is now known that Benrling slow and regular variation 
include the Karamata and Bojanic-Karamata/de Haan theories [BinOl]. It 
thus shares their feature, of having a representation theorem: one can give 
an explicit description of all functions in ( BMA ^), in SN, etc. Our results 
follow simply from this. The functions in SN have representations of the form 

f*X 

<ft{x) = c(x) / e(u)du 

Jo 

with c(.) —* c G (0, oo), e(.) —> 0 and the product positive (BGT §2.11; 
[BinOl]); by uniformity in ( BMA ^), one may take (ft(x) = c f Q v e(u)du, so 
(ft'(x) = ce(x) —>• 0. 

We quote the following results from [BinGo2, Th. 1,2] and [BinGol, Th. 
3] respectively. The first links the two themes of our title, the second is a 
representation theorem. 

Theorem BGl. For <ft e SN, write 

/ X 

du/(ft(u). 

Then ( BMA for a sequence (s n ) implies (i?(A)) and (R(fj)), where 

\{x) := 4>(x) exp{<F(a;)}, p(x) := exp{<F(a;)}. 

Note. In the second consistency theorem, what is crucial is log A. This is 
log <f> + <h, and as (p(x ) = o(x), log (ft is o(logx). But <h(a;) = du/(ft{u) is of 
order greater than J]’ du/u = logo:, so log A ~ log/a = <F. Thus comparabil¬ 
ity of two ^-functions gives comparability of their ^-functions on integrating, 
so of their (log p or) log A functions, so equivalence of the Riesz means. Fur¬ 
ther, convergence improves as (ft increases, that is, A decreases, as expected 
in §1. 

Theorem BG2. The function U satisfies ( BMA ^) iff it is of the form 

/ X 

(1 + 7](u))du with rj(.)0. 

This provides a transparent proof of the following result, which links 
Beurling moving averages BMA^ with different self-neglecting functions (ft. 
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Theorem. If <fi, ifi e SN with <fi < f: 

(i) U —> c (. BMA implies U —> c (. BMA ^). 

(ii) The converse holds iff there exists a function 8 —>• 0 such that V U + 5 
satisfies (. BMA$). 

Proof. The direct assertion (i) just says that off) is off)] this is Abelian. The 
converse assertion (ii) says that 'if there is an o(l)-perturbation V — U + 5 
of U satisfying ( BMA )^, then so does U. This is just linearity in (. BMA)$. 
Also, this perturbation condition is necessary as well as sufficient (if the con¬ 
clusion holds, the perturbation condition holds with 5 = 0), and is thus a 
necessary and sufficient Tauberian condition. // 

This result, though nearly immediate given the representation theorem 
Th. BG2, has distinguished antecedents. For the Abelian result (i), the 
background is (what became known as) the second consistency theorem for 
Riesz means (§1); this emerged in work of Riesz in 1909 and 1916 [Riel,2], 
Hardy [Har2,3] in 1910 and 1915, [HarR Th. 17] in 1915. For the Tauberian 
result (ii), the origin is Hardy’s result of 1904 ([Harl, §6, p. 40]; [Har2, 55; 
Comments, 83]; [Har4, Th. 149]), that Cesaro convergence with rate off/\fn) 
gives Borel convergence. (This is extended to general order k for Borel’s in¬ 
tegral method by Hardy and Littlewood in 1916 [HarL, §3].) It was further 
extended to Euler convergence by Knopp in 1923 [Kno]. 

Hardy and Littlewood introduced a family of summability methods in 
[HarL], for the purpose of analytic continuation by power series [Har4, VIII]. 
They called these the (e, c) methods, though the term Valiron methods is 
now used; there is a textbook treatment in [Har4, IX, 9.10, 9.11] in 1949. 
Also in 1949, Meyer-Konig [MeyK] gave an extended treatment of the cir¬ 
cle methods ( Kreisverfahren: methods of Euler, Borel, Valiron, Taylor and 
Meyer-Konig; there is a modern textbook account in [Kor, VI]); unfortu¬ 
nately, this came too late for inclusion in [Har4], The Riesz method linked 
to the Borel, Euler and circle methods is Rfe^™). This is equivalent, both 
to the Beurling moving average BM^ and to Cesaro convergence with rate 
o{l/y/n) of some o(l)-perturbation as in (ii) of the Theorem [Bin2,3]. The 
Abelian result that convergence under Rfey™) implies Euler (and so Borel) 
and Valiron convergence follows from Knopp’s result in the Euler case and 
work of Hyslop [Hys] in the Valiron case, Parameswaran [Par] in the circle 
case; see also [Bin2,3]. The Tauberian converse is proved for ,s n non-negative 
(equivalently, bounded below: s n = Ol( 1)) by Tenenbaum [Ten], It is given 
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under a best-possible one-sided Tauberian condition in [BinGol]; cf. [Binl], 
[BinT], That all these methods are equivalent for bounded sequences is due to 
Meyer-Konig [MeyK]; for a simple proof in the Eulcr-Borel case, see Jurkat 
[Jur], Equivalence of all these methods for bounded sequences follows from 
Beuiiing’s Tauberian theorem (§1; see [Kor, IV. 11]), developed by Beurling 
for this purpose. 

For 0 < (5 < 1, equivalence of the Riesz mean R(exp{n 1_/3 }) with BMA n p 
and with perturbed Cesaro convergence with rate o(l/n 1_/3 ) is in [BinT, Th. 
3]. One can extend this to include fd = 1, by using i?(exp{J] n dx/x 13 )) (see 
§3.4 below). Part of this result was obtained much earlier by Wang [Wan], 

The one-sided Tauberian condition above derives from work of Erdos and 
Karamata [ErdK] in 1956 and Bojanic and Karamata [BojK] in 1963; see 
BGT §3.8 for a textbook account. 


3. Remarks. 

1. Riesz means and their summability. 

The theory of Riesz means owes its development (by Riesz, 1909-1916: 
[Riel-3]) to the intense interest in Dirichlet series in general and the Riemann 
zeta function in particular in the early 1900s, following the proof of the Prime 
Number Theorem in 1896. For background, see e.g. Landau [Lan, Sechstes 
Buch], [HarR], [BohC, 1.5,6]. For example, the Riemann zeta function cannot 
be extended by any Cesaro method of summability [Har2, §6]. However, the 
alternating zeta function ^^°(—)" _1 /n s is summable by Cesaro means of 
order k if a := Re s > —k (Bohr [Bohl, II.3 Th. I]; [HarR, 20, 24]), and as 
Sr ( — ) n ~ 1 / nS = C( s )(l — 2 1-s ), this can be used to continue (. See [HarR 
IV] for general results, and [HarR VI] for Abelian and Tauberian theorems. 

2. The logarithmic method. 

One writes 

1 n 

1- y^ s k/(k + 1)s (n —>■ oo) 

log n 


as 

s n t s (£), 


and says that s n converges to s under the logarithmic method of summability. 
Then £ is the Riesz method i?(log n) (see e.g. [Har4, Th. 37]). As log n grows 
more slowly than n, this method is less restrictive than the Cesaro method 
C — C\ — R(n), while R(e ^”) is more restrictive. To summarise: 


i?(exp{i/n}) C R(n) C R(logn). 
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The logarithmic method has recently been studied in detail by the author 
and Gashi [BinGa]. One obtains moving averages there too, but these are 
not of the Beurling type of §§1,2, [BinGol,2] etc. 

3. Regular variation. 

As noted above, Beurling regular variation [Bin04] contains both the 
original Karamata regular variation and its later development by Bojanic 
and Karamata and by de Haan. A structural feature of any theory of regular 
variation is a representation theorem. As the simple proof of our Theorem 
clearly shows, it is this that is crucial here, ft is precisely the lack of any 
awareness of this aspect - that once one can identify regular variation in 
some form, a complete and explicit description of the function in question is 
available - that delayed the solution of the problems considered above. 

4. Laws of Large Numbers (LLNs). 

The central result of probability theory is Kolmogorov’s strong law of 
large numbers (SLLN) of 1933 [Kol]: for X,Xi,..., X n ,... independent and 
identically distributed (iid) random variables, the sample mean S n /n : = 
n Xu Xk converges with probability one (almost surely, a.s.) to some con¬ 
stant c iff X G L i, i.e. A[|X|] < oo; then c = E[X\, the population mean. 
The summability method here is the Cesaro method C — C\ ; the moment 
condition here is membership of L±. 

This result has variants of many kinds; the one that concerns us here 
is alternatives in which both the summability method and the integrability 
condition are changed. Both are strengthened if one works with 

/ n 

dx/x 1 ^}) (p > 1); 

it turns out that the corresponding integrability condition is membership of 
L p (see §3.7 below). The most important case is p = 2 discussed above, 
where the method is f? 2! equivalently R(e^), and the integrability condition 
is existence of a variance (as distinct from Kolmogorov’s SLLN for p — 1, 
which is about means). It turns out that the Euler and Borel (and Valiron) 
methods are interchangeable with R -2 here; for details and references, see e.g. 
[BinT], But one is not restricted to powers here, as is shown in [BinGo2, §3]. 
There, the summability method is R( A), with A(x) := expjj) 1 du/(f{u)} with 
(j) G SN] the integrability condition isE[^(|X|)] < oo with if := <f^~ and if' 
assumed of bounded increase (roughly, polynomial growth). This restriction is 
necessary for probabilistic reasons (a.s. invariance v. non-invariance, Erdos- 
Renyi law, etc.). The proof hinges on maximal inequalities , as usual with 
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strong limit theorems in probability theory. Again, one sees that, with two 
comparable functions 0 (or 0 := 0**“), it is the bigger 0 (smaller 0, bigger 
A := explJj 1/0}) that is the more restrictive. 

By contrast, both summability and integrability are weakened when we 
pass from C to the logarithmic method. It turns out that this case is very 
interesting and important, because of its connection with the almost sure 
central limit theorem (ASCLT). We refer to [BinR], [Bin6], [BinGa] for details 
and references here. The logarithmic method is also important in analytic 
number theory, in connection with the Prime Number Theorem and with 
densities of sets of integers; again, see [BinGa] for details. 

5. CLT; LIL, LSL. 

The second main pillar of probability theory is the central limit theorem 
(CLT: the folklore names of LLN and CLT are the Law of Averages and 
the Law of Errors). This too has a counterpart in this context, due to 
Embrechts and Maejima [EmbM], Intermediate between the two is the law 
of the iterated logarithm (LIL). This applies to the Cesaro method, C = Rp 
its counterpart for R 2 contains only one logarithm, and is called the law of 
the single logarithm (LSL); see e.g. [BinM], [Bin5] for details and references. 

For R p , Gut, Jonsson and Stadtmiiller [GutJS] obtain a result which is, in a 
sense, between LIL and LSL. 

6. Tauberian theorems between Riesz means. 

We mention here the work of Jurkat, Kratz and Peyerimhoff [JurKP], 
Here, given three Riesz means _R(Aj,/q) ( i = 1,2,3), if two Riesz means of 
a function have given bounds, best-possible bounds are given for the third. 
These results are used in [Bin5] to show that for p > 1, the Riesz mean R p 
is strictly more restrictive than the Cesaro mean C\/ p . 

7. Discontinuity across p = 1. 

The Cesaro methods C a for a > 1 are equivalent in the context above of 
LLNs (to each other and to the Abel method A): 

A G L\ "vv - X n —t* p a.s. (Ci) X n —^ p a.s. (C Q ) (o? ^ 1). 

By contrast, for a < 1 (a — 1/p above) the integrability required is the more 
stringent Li/ a : 

X G Li/ a "vv - X n —t* p a.s. ( R\/a ) X n —^ p a.s. (C a ) (0 <C cc ^ 1) 

see [Bin5]. This striking discontinuity reflects the distinguished role played 
by the Kolmogorov SLLN among results of this type. This is seen also in the 



different nature of the LLNs for the logarithmic method [BinGa], where one 
assumes less integrability than in the SLLN, rather than more as above. 

Despite this, there are two results which handle L p with p < 1 and p > 1 
together (though there is a difference even there: one centres at the mean 
where the mean exists): the Marcinkiewicz-Zygmund law and the Baum- 
Katz law [MarZ], [BauK], For a textbook account, see [Gut, 6.7, 6.12.1]; for 
a survey of extensions to the strong law generally, see [Bind], 

8. Self-neglecting and self-equivarying functions. 

The self-neglecting functions <f G SN of §1 have <f>{x) = o(x ) (by defini¬ 
tion, or from the representation). It turns out that one may usefully extend 
this to 4>(x) = O(x), but no further; the resulting functions are called self- 
equivarying functions, forming the class SE. It is this extension from SN 
to SE that enables the Karamata and Bojanic-Karamata/de Haan theories 
of regular variation to be subsumed within that of Beurling regular vari¬ 
ation [BinOl], [Ostl,2], But this shows why the logarithmic method lies 
outside the Beurling framework: \(x) = logo; = exp{ f* du/(j)(u)} would 
give 4>(x) = a: log a:, which is not 0(x). 

9. Systems of kernels. 

We saw in the Beurling Tauberian theorem the use of H := I[- c ,o]■ This 
cannot be used as a Wiener kernel, as its Fourier transform has zeros 2nn/c. 
But for two such Hi with q incommensurable, there are no common zeros, 
and in the Wiener Tauberian theory (and its Beurling extension) this suffices. 
See [BinI2] for a case in point (the motivation is analytic number theory, in 
[Binll]). For by Kronecker’s theorem (see e.g. [HarW, XXIII, Th. 438]) the 
additive group generated by ci,C 2 is then dense in M. This bears on quanti¬ 
fier weakening: the quantifier V in BMA$ may be weakened to just two such 
incommensurable cs. This is a common theme in regular variation; see BGT 
Th. 3.2.5, and the recent [Bin02], 

Postscript. This paper was written in the summer of 2014, shortly before 
the centenary of the outbreak of World War I. It will probably appear in 
2015, the centenary of the Hardy-Riesz Tract (from the Preface, by Hardy, 
19 May 1915: ‘The publication of this tract has been delayed by a variety 
of causes, and I am now compelled to issue it without Dr Riesz’s help in the 
final correction of the proofs’: Riesz, who was Hungarian, was in Stockholm). 
We recall their unforgettable dedication: 

Mathematicis quotquot ubique sunt operum societatem nunc diremptam 
mox ut optare licet redintegraturis d.d.d. auctores hostes idemque amici. 


9 



[To mathematicians, however many and wherever they may be: that they 
may soon again take up, as is to be hoped, the fellowship of their work now 
disrupted, we the authors, enemies and yet also friends, present and dedicate 
(this book).] 
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